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ABSTRACT 

A  number  of  alternative  data-generating  processes  are  explored  for 
mail-order  demands  for  seasonal  style-goods.  Weekly  demands  for  126  items 
over  an  18-week  selling  season  provide  the  empirical  data. 

Arguments  are  presented  which  result  in  the  following  candidates  for 
data-generating  processes:   (1)  ratios  of  successive  forecasts  are 
distributed  lognormally;  (2)  ratios  of  successive  forecasts  are  distributed 
as  t_  (Student);  (3)  actual  demands  during  unequal  time  periods  are  distributed 
as  negative  binomial.   Analysis  of  the  data  suggests  the  negative  binomial 
data-generating  process  as  both  most  closely  representing  the  underlying 
process  and  being  simple  to  adapt  to  a  decision  model.   The  paper  concludes 
with  an  example  of  the  use  of  the  chosen  data-generating  process  in  a 
decision  framework,  and  deals  briefly  with  some  issues  of  implementation. 


G3G041 


I.    Introduction 

We  consider  seasonal  style  goods  with  a  limited  selling  season  which 
are  ordered  by  mail  from  a  catalog.   Our  goal  is  to  achieve  a  mathematical 
model  representing  the  demand  data-generating  process  for  such  items  over 
the  selling  season.   This  model  can  be  subsequently  combined  with  a  decision 
framework  dealing  with  when  and  how  much  to  order  of  each  item,  given 
various  costs  of  oversupply  and  stockout. 

Management  of  seasonal  style-goods  inventory  has  been  studied  by  a 
number  of  authors  (see  [2],  [3],  [6],  [7],  [8],  [9],  [12],  [16]).   A  review 
of  most  of  this  literature  is  given  in  [6]  and  will  not  be  repeated,  except 
to  summarize  that  much  of  the  previous  work  cited  presents  models  without 
testing  the  assumptions  concerning  the  data-generating  process  for  demands. 
References  [8]  and  [16]  do  deal  with  actual  data:   demands  for  women's 
apparel  at  the  retail  store  level.   These  studies  reveal  that  retail  sales 
tend  to  be  proportional  to  the  amount  of  inventory  displayed;  however,  the 
mail-order  case  never  runs  out  of  display  inventory,  and  is  thus  fundamentally 
different  from  the  retail  case.   Reference  [7]  deals  with  data  generalized 
from  wholesale  demands  placed  on  a  textile  manufacturer,  which  again  is 
quite  distinct  from  the  retail  mail-order  case. 

The  plan  of  the  paper  is  as  follows.   Sections  II,  III  and  IV  will 
describe  three  alternative  models  for  the  demand  data-generating  process, 
with  empirical  results  suggesting  the  third  model  as  being  most  appropriate. 
Section  V  combines  the  chosen  model  with  an  economic  decision  framework  for 
when  and  how  much  to  order,  and  deals  briefly  with  some  issues  of  implementation. 


In  particular,  [12]  assumes  a  continually  increasing  mean  demand  rate 
throughout  the  entire  season,  an  assumption  flatly  contradicted  by  the 
empirical  results  of  [7],  [8],  and  [lG]. 


II.   The  Lognormal  Model 

Instead  of  studying  demands  for  seasonal  goods  directly,  we  may  focus 
on  statistical  properties  of  forecasts  (and  forecast  revisions)  of  total 
seasonal  demand  for  each  item,  where  the  forecast  mechanism  is  either  a 
simple  extrapolation  of  sales-to-date  (as  in  [5]  and  [7])  or  a  more  complex 
Bayesian  combination  of  an  initial,  prior  forecast  and  experience  to  date 
(as  in  [2],  [3]  and  [9]).   Previous  empirical  work  [5]  has  indicated  that 
under  certain  circumstances,  systems  for  generating  and  revising  forecasts 
of  some  unknown  quantity  may  have  the  property  that  ratios  of  successive 
forecasts  can,  as  a  first  approximation,  be  treated  as  independent  random 
variables  distributed  according  to  the  Lognormal  distribution. 

Specifically,  we  divide  the  season  into  periods  j  =  1,...,N.   Let  X. 
represent  the  forecast  of  total  seasonal  demand  for  a  good,  with  the  forecast 
made  at  the  end  of  period  j ;  also  let  X^  represent  actual  total  demand  for 
the  good.   Define 

(1)  Z.    =  Xj+1/Xj  ,  j  =  1.....N-1  . 

Then   the  above  assumptions  may  be  restated  as   follows;   Z.    has   the 

distribution 

-(log  z.-p  )2/2<r  2 

(2)  Zi    *  WW   =  172 e  '      J   -1. •••.»-! 

with  Z  independent  of  Z,  for  j  ^  k. 

The  Data 

Before  the  above  assumptions  are  tested,  we  briefly  describe  the 
nature  of  the  data.   Weekly  mail-order  demands  for  126  separate  items  of 
women's  sportswear  were  obtained  over  an  18-week  selling  season.   The 


3 
aggregate  demand  pattern  (across  items)  over  the  season  is  graphed  in 
Figure  1. 


Sales.  =  3337  e 


-.1434(t-l) 


Figure  1:   Aggregate  Mail-Order  Demand  by  Week 


Since  aggregate  demand  in  week  one  and  in  the  latter  weeks  of  the  season 
was  quite  low,  the  weeks  were  consolidated  into  N  =  9  different  (unequal) 
time  periods  in  an  attempt  to  gain  rough  uniformity  of  the  aggregate  demand 
rate  per  time  period. 

The  lognormality  assumption  was  tested  against  forecasts  of  total 
seasonal  demand  for  each  item,  with  each  forecast  being  revised  after  each 
time  period.   The  forecasting  method  was  a  simple  extrapolation  of  item 

demands  to  date,  using  the  aggregate  percentage  of  demands  in  each  of  the 

2 
nine  periods  as  the  basis  for  extrapolation.   Then  ratios  Z  of  successive 

forecasts  of  total  demand  for  each  item  were  computed  as  in  equation  (1), 

and  the  natural  logarithm  W.  =   ln(Z.)  was  computed  for  all  items.   If 

Z  is  distributed  Lognormally,  then  by  definition  its  natural  logarithm 

is  distributed  Normally: 

-(W  -u  )2/20  * 

(3)      vWvv— W\  ,i~1 ""■• 

Figures  2,  3  and  4  illustrate  ordered  plots  of  W  ,  W2  and  Wg  on  Normal 
probability  paper.   The  probability  paper  is  constructed  such  that  a  true 
Normal  variate  would  lie  on  a  straight  line  except  for  sampling  variation. 

A  reasonable  fit  occurs  for  Wg  =  ln(Zg)  =  ln(X9/Xg)  but  the  other 
examples,  W  and  W  ,  show  an  unsatisfactory  curve  with  a  very  high  right 
tail.   (Sample  third  and  fourth  moments  also  indicate  significant  departures 
from  Normality.) 


There  is  a  minor  amount  of  circular  computing  here,  but  we  felt 
that  126  items  was  a  sufficient  number  so  that  the  effect  of  demands 
of  any  one  item  on  the  aggregate  percentages  could  be  ignored. 
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Figure   2:      W     =   log  of  X2/X1 
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Figure  3:   W2  =  log  of  X3/X2 
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Figure  4:      Wg  =   log  of  X9/X{ 


8 
The  Lognormality  assumption  implies  equal  "percentage  variability"  across 
items  selling  at  substantially  different  rates.   This  aspect  of  the  Lognormality 
assumption  was  tested  by  treating  separately  51  low-unit  volume  items  and  51 
high-unit  volume  items  (omitting  24  items  with  pre-season  sales  from  a 
previous  catalog).   Table  1  contains  the  ratio  of  the  standard  deviation  of 
W.  for  the  low-volume  items  to  the  standard  deviation  of  W.  for  the  high- 
volume  items. 


Table  1 


Ratio 


Standard  Deviation  of  W.  for  51  low-volume  items 
1 

Standard  Deviation  of  W.  for  51  high-volume  items 


J: 

12345678 

Mean 

Ratio: 

2.109     1.773     2.120     1.415      2.181     1.904     1.642      1.762 

1.863 

The  relative  variability  in  the  forecast  revision  process  for  low-volume 
items  is  substantially  larger  than  that  for  the  high-volume  items,  a  phenomenon 
frequently  found  in  demand  analyses.   Treating  all  126  items  as  being  from  a 
single  population  thus  discards  information  that  relative  variability  of 
forecast  revisions  is  inversely  related  to  the  average  demand  rate  for  an 
item. 

The  independence  assumption  (that  Z   is  independent  of  Z^   for  j  4   k) 
was  tested  by  performing  a  correlation  analysis  on  the  W  's  (see  Table  2). 
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Table  2 

Intercorrelation  Matrix  for  Logs  of 
Ratios  of  Successive  Forecasts 


Wl 

W2 

W3 

W4 

W5 

W6 

W7 

wl 

1.0000 

W2 

-0.0811 

1.0000 

W3 

-0.1214 

0.0705 

1.0000 

W4 

0.0596 

-0.2439 

-0.0692 

1.0000 

W5 

-0.0662 

-0.0943 

-0.0012 

0.1527 

1 . 0000 

W6 

0.1016 

-0.1132 

-0.0395 

0.1047 

0.1529 

1.0000 

W7 

0.0928 

0.1062 

0.0601 

-0.1984 

0.1537 

0.0391 

1.0000 

W8 

0.0058 

-0.1615 

-0.0448 

0.1468 

0.4012 

0.2728 

0.1094 

Table  2  contains  three  correlations  greater  than  0.230  in  absolute  value, 
which  represents  the  2%  significance  level  for  n  -  126.   This  would  indicate 
the  hypothesis  of  mutual  independence  of  forecast  revisions  across  time  periods 
to  be  false;  however,  no  causal  explanation  has  been  found  for  the  particular 
non-zero  correlations  (r?,,  r__,  r, „)  .   One  must  have  continuing 
systematic  departure  from  independence  in  order  to  be  able  to  make  use  of 
the  dependency.  We  therefore  interpret  the  test  of  independence  as  being 
inconclusive. 

Based  on  Figures  2  through  4  and  Table  l's  results,  the  Independent 
Lognormal  model  is  not  adequately  supported  by  this  data. 
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III.   The  Heterogeneous  Lognormal  Model  (_t  -  distribution) 

As  noted  above,  the  standard  deviation  O     of  each  of  the  W. 's  is  not 
constant  across  items.   Suppose  that  parameter  were  considered  as  a  random 
variable.   Then,  having  chosen  the  form  and  parameters  of  the  distribution 
of  a  ,  that  distribution  may  be  combined  with  the  (conditional)  Lognormal 
distribution  above,  and  integration  over  the  range  of  values  for  a  would 
produce  the  marginal  distribution  of  W..   This  precise  approach  was  recently 
used  by  Praetz  [10]  in  connection  with  the  distribution  of  stock  price 
changes . 

The  chosen  distribution  for  a.  must  be  such  that  one  can  perform  the 
indicated  integration,  and  for  this  reason  we  use  the  corresponding  Bayesian 
natural  conjugate  prior  distribution  (see  [11]).   However,  this  borrowing  of 
a  mathematically  tractable  distribution  from  Bayesian  decision  theory  does 
not  mean  that  a  Bayesian  approach  is  being  used  in  this  model.   To  the 
contrary,  we  (as  Praetz)  argue  in  this  section  that  there  is  underlying 
variability  in  the  parameter  in  question  (a.),  which  is  not  reduced  by  any 
sample  information  but  which  simply  remains  throughout  the  process. 

Specifically,  equation  (3)  is  now  interpreted  to  be  the  conditional 
probability  density  function  for  W.,  conditional  on  a  ; 

_(W -y)2/2a2 

(4)       «  '"  I  ~  ■*  - 


VwjlV  =777172 


(2Trr'  c^ 


As  a  conjugate  prior  distribution  we  follow  Praetz  [10]  and  assign  an 

2 
J  ' 


3  2 

inverted-gamma-one  distribution  on  O 


3See  [10],  p.  51,  and  [11],  p.  227. 


2  2 
/  2\      /   -,x   2.      2.      lNm  -2(m+l)  -(m-l)an  /a   /r,  . 

(5)  ivl(  j  'm»<m-l)a0  >  =  a0  (m_1)  a       e  /r(m) 

2      2  2      4 

with  E[a  ]  =  a   and  Var  [a  ]  =  a  /(m-2).   Then  the  unconditional 

distribution  of  W.  is,  following  [10],  a  Student  distribution: 

oo 

(6)  fs(W.)  =  /  fN<wj|aj)flYl(aj2|iu,(ni-l)o02)  da2 

_  _  1_  1 

=  [1+(W  -y.)2/aQ2(2m-2)]     !   r(m)  [(2m-2)Tr]2  aQ  : 

"...  this  is  at  distribution  of  2m  =  n  degrees  of  freedom,  except 

1/2 
for  a  scale  factor  [n/(n-2)]    .   Thus,  the  distribution  of 

[(W.-y.)/an]  would  be  that  of  a  scaled  t-distribution.  When  this 

distribution  is  plotted  against  a  standard  normal  distribution  for 

n  small,  it  reproduces  the  characteristic  distribution  of  share 

price  changes  described  earlier"  [10,  pp.  50-51]. 


Figures  5  and  6  contain  plots  of  scaled  t-distributions  with  n  =  4  and 
n  =  6  degrees  of  freedom  on  Normal  probability  paper.   Comparison  with 

Figures  2  through  4  indicates  that  the  shape  of  the  t-distribution  would 

4 
generally  provide  a  better  fit  to  the  W.  data  than  the  Normal  distribution. 

Praetz  fitted  t-distributions  to  logarithms  of  share  price  ratios  for 

seventeen  indices,  using  weekly  data  from  the  Sydney  Stock  Exchange.   Full 

details  are  in  [10]  but  for  the  seventeen  series,  Praetz  found  the  best 

n  value  (degrees  of  freedom)  was  3  twice,  4  twelve  times,  5  twice  and  7  once, 

Thus,  for  logarithms  of  ratios  of  successive  index  values  for  stocks,  a 


Comparison  of  Figures  5  and  6  with  Figures  8  through  10  of  [5]  also 
;ate  that  the  _t-distribution  woi 
better  than  the  Normal  distribution. 


indicate  that  the  t-distribution  would  tend  to  fit  those  sample  W  values 


^-distribution 

Normal   distribution  fitted    to 
variance    of    t-distribution 


Figure   5:      t-distribution  with  n  =   4  degrees  of    freedom 
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^-distribution  with  approximately  n  =  4  degrees  of  freedom  seems  to  fit 
the  data  quite  well,  substantially  better  than  other  candidates  such  as 
the  Normal  distribution,  the  Stable  Paretian  distribution,  and  the 
Compound  Events  distribution  [10,  p.  53]. 

With  this  background,  the  _t-distribution  was  fitted  to  our  W.  data; 
see  Appendix  1  for  the  details.  Table  3  contains  the  degrees-of-f reedom 
value  which  provided  the  best  fit  in  each  case. 

Table  3 
Degrees  of  Freedom  for  Fitted  _t-distribution 


Data: 

Wl 

W2 

W3 

W4 

W5 

W6 

W7 

W8 

Degrees  of 
Freedom  (n) : 

4 

11 

3  or  4 

6 

3 

5 

6 

6 

Thus  small  values  of  n  generally  provide  the  best  fits  for  our  data,  as  was 
found  in  [10].5 

As  compared  to  the  Independent  Lognormal  Model,  this  Heterogeneous 
Lognormal  Model  leading  to  a  ^-distribution  of  W.  fits  the  data  better. 
However,  in  contrast  to  Praetz,  we  have  already  found  a  reason  for  the 
apparent  variability  in  o  across  the  126  items  in  our  sample;  namely,  that 
relative  variability  is  inversely  related  to  the  average  demand  rate  for 
the  item.   For  this  reason,  although  the  second  model  is  an  improvement 


Values  of  n  less  than  5  may  appear  to  create  difficulties  with  the 
prior  distribution  of  a2  in  equation  (5),  since  n=4  implies  m=2  and  the 
variance  of  a2  is  then  undefined.   The  solution  is  to  use  h  =  1/a  as  a 
transformed  variable,  assessing  a  gamma-two  distribution  on  h  [11,  pp. 
226,291]  to  avoid  this  difficulty. 
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over  the  first,  we  turn  to  a  third  model  capable  of  incorporating  the 
relationship  of  relative  variability  to  average  demand  rate. 

IV.   The  Heterogeneous  Poisson  Model  (Negative-Binomial  Distribution) 

Instead  of  considering  the  statistical  behavior  of  forecasts  (and 
reforecasts)  of  total  season  demand  for  each  item,  we  now  focus  on  the 
number  of  demands  for  a  given  item  occurring  during  a  stated  number  of 
aggregate  transactions   (e.g.,  every  2000  aggregate  demands   across  all 
items).   A  simple  Poisson  model  is  first  explored,  followed  by  a  hetero- 
geneous Poisson  model. 

On  a  transactions  basis  the  underlying  characteristics  of  the  simple 
Poisson  process  seem  applicable  to  the  mail-order  situation;  units  are 
ordered  (generally)  one  at  a  time,  and  in  any  given  selling  season,  the 
average  number  of  demands  for  a  given  item  in  a  given  time  period  should 
be  approximately  proportional  to  the  total  number  of  transactions  in  the 
stated  time  period.  Moreover,  since  the  variance  equals  the  mean  in  a 
Poisson  process,  the  measure  of  relative  variability  used  in  the  W  data 
analysis  above  (the  standard  deviation  of  W.  the  logarithm  of  the  ratio 
of  successive  forecasts)  would  theoretically  decrease  as  the  average 
demand  rate  increased;  this  does  in  fact  happen,  as  indicated  in  Table  1. 
Thus,  the  simple  Poisson  model  was  a  candidate  for  analysis. 


We  consider  transactions  rather  than  calendar  time  because  as  Figure  1 
indicates,  the  aggregate  demand  rate  is  not  constant  over  the  selling  season. 
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Poisson  arrivals  imply  exponential  inter -arrival  times,  as  is  well- 
known.   If  the  time  between  demands  for  each  item  were  available,  this 
data  could  be  plotted  against  an  exponential  distribution  and  the  goodness 
of  fit  evaluated  statistically.   However,  inter-arrival  data  were  not 
available,  so  less-direct  tests  were  used. 

Table  1  showed  that  the  mean  value  across  all  periods  of  the  ratio 
(standard  deviation  of  W  for  51  low-volume  items) /(standard  deviation  of 
W.  for  51  high-volume  items)  was  1.863.   The  mean  demand  rate  for  the  51 
high-volume  items  was  304.1  and  for  the  51  low-volume  items  was  87.9.   If 
the  standard  deviation  of  the  W.'s  is  considered  to  represent  a  measure  of 
variability  in  underlying  demand  occurrences,  then  since  the  variance  equals 
the  mean  for  a  Poisson  process,  the  ratio  of  standard  deviations  (1.863) 
should  not  be  too  different  from  the  ratio  of  the  square  root  of  the  two  mean 
demand  rates:   (304.1/87.9)    =  (3.459)1/2  =  1.860.   This  result  should  be 
interpreted  as  being  consistent  with  a  Poisson  process  in  that  the  relation- 
ship of  means  and  variances  is  maintained  in  a  dichotomy  grouping. 

A  second  investigation  involved  considering  the  nine  time  periods  and 
126  items  as  1134  realizations.   If  demand  in  each  period  for  each  item  was 
Poisson,  then  the  expected  demand  for  item  i  in  period  j  would  be  estimated 
by  taking  the  total  seasonal  demand  for  item  i  and  pro-rating  that  by  the 
aggregate  percentage  of  demands  occurring  in  period  j .   Thus  if  item  5  had 
200  demands  in  total  and  if  period  6  contained  10%  of  the  total  number  of 
orders  across  all  126  items,  then  the  estimate  for  expected  demand  for  item 
51  in  period  6  would  be  200 (.10)  =  20  units. 
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For  each  of  the  1134  realizations,  an  expected  frequency  (e.g.,  20) 
was  calculated  as  indicated.   Then,  using  those  1134  separate  values  in 

turn,  the  probability  of  0,  1,  2 125  demands  occurring  was  calculated; 

and  these  probabilities  were  accumulated  by  the  number  of  occurrences. 
The  result  was  a  table  of  expected  frequency  of  cells  containing  0,  1,...,125  demands 
in  the  1134  realizations.   Appendix  2  contains  a  listing  of  expected  and 
actual  frequencies  of  the  various  counts  over  the  1134  realizations;  the 
actual  and  expected  frequencies  are  quite  similar.   The  cells  were  con- 
solidated into  27  groups  so  that  a  chi-square  test  could  be  performed, 
with  a  resulting  value  of  chi-square  of  12.753;  for  24  degrees  of  freedom' 
this  has  a  P  of  .975. 

The  above  test  is  necessary  but  not  sufficient  to  demonstrate  an 
underlying  Poisson  process.   It  is  possible  for  a  totally  different  data- 
generating  process  to  generate  demands  in  such  a  way  that  when  analyzed 
in  summary  form  as  above,  the  results  could  be  identical.   Thus,  further 
testing  was  required. 

A  third  exploration  involved  computing  the  ratio  of  equation  (7)  for 

a  random  sample  of  41  items  (1/3  of  the  data) : 

2 
,-s  _   .     (observed  -  expected) 

(7)       Ratio  = r — i 

v  '  expected 

where  observed  represents  the  realized  item  demand  of  each  of  the  item/period 

realizations  described  above,  and  expected  refers  to  the  expected  demand 


One  could  debate  what  the  appropriate  degrees  of  freedom  should  be 
after  consolidation,  but  no  matter  what  the  conclusion,  this  is  still  a 
very  low  value  for  a  chi-square  statistic. 
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calculated  in  the  pro-rata  manner  described  above.   Over  41  items  and  9 
periods  (369  observations)  the  average  value  for  the  ratio  of  equation  (7) 
was  1.194.  Now  the  expected  values  were  distributed  over  a  wide  range, 
averaging  23,591/1134  =  20.8,  and  the  skewness  (large  or  small)  of  the 
hypothesized  Poisson  distributions  makes  it  not  possible  to  make  a  signi- 
ficance test  on  the  quantity  1.194.   One  would  suspect,  however,  that 
since  the  amount  of  skewness  in  a  Poisson  process  with  parameter  X  =  20.8 
is  quite  small,  the  mean  ratio  value  of  1.194  is  suspiciously  greater  than 
one,  thereby  indicating  that  the  variance  may  be  greater  than  the  mean. 

This  leads  us  to  consider  the  heterogeneous  Poisson  process  in  which 
the  Poisson  parameter  is  treated  as  a  random  variable  with  a  gamma-one 
distribution,  resulting  in  the  unconditional  distribution  of  demands  per 
period  being  negative  binomial;  the  latter  distribution  can  represent  a 
situation  where  the  variance/mean  ratio  is  greater  than  one. 

Let  us  define  the  following  symbols: 

t.   =  number  of  aggregate  transactions  (demands)  in  period  j 

D.   =  number  of  demands  for  item  i  in  period  j  (a  random  variable) 

ij  9 

K.   =  total  seasonal  demand  for  item  i:   (K.  =  I   D..);  (a  random  variable) 

1   J-l  1J 

j 
T.   =   E  t,   cumulative  aggregate  demands  from  start  of  season 

k=l    through  period  j 
T   =  TQ  =  total  aggregate  demands  over  entire  season  for  all  items 

=  23,591 
X        =  probability  that  a  transaction  is  a  demand  for  item  i ;  A±  =  Kjl 

i 

S   =   Z  D   =  cumulative  demands  for  item  i  from  season  start 

k=l      through  period  j 
r   =  k  -  S   =  demands  for  remainder  of  season  for  item  i  (a  random  variable) 

n   =  t  -  T.  =  aggregate  transactions  for  remainder  of  season  (presumed  known) 


19 
Now  after  a  given  period  j ,  cumulative  demands  for  item  i  are  given 
as  S...  We  desire  the  probability  function  of  r   which  represents  the 
remaining  demands  for  item  i  yet  to  come  in  periods  j+l,...,N.   Presuming 

Q 

the  seasonal  pattern  is  reasonably  specified,   the  values  of  n.  are 
assumed  known.   Then,  according  to  the  Poisson  process  assumption,  the 
conditional  probability  of  r  .  remaining  demands  for  item  i  occurring  in 
the  next  n.  aggregate  transactions  is: 

(8)       fp(rij|A.n.)  =  e"X^(Xin.)r^/ri.!    . 

A  natural  conjugate  prior  distribution  [11,  p.  277]  for  X  is  the  gamma- 
one  distribution 

-XT       S   -1 

c)      VMW-  e    (x*y    " 

(s.ri)i 

with  E[X.]  =  S../T.,  Var(Xi)  =  S../(T.)2,  and  mode  (X.)  =  (S   -1)/T  . 
This  gamma-one  distribution  of  (9)  describes  our  information  relevant  to 
X.  from  a  sample  of  aggregate  transactions  of  size  T.  containing  S..  suc- 
cesses (i.e.,  demands  for  item  i) .   Combining  (8)  and  (9),  the  unconditional 
distribution  of  r.j  is  negative  binomial  [11,  p.  284]: 

(10)      P{r1.|Slj,Tj,n.}  =  /  f^  .  |  X^.  )fyl  (X  .  |  S.  .  ,T.)dX. 

■f^<rijl^  Sij} 

(r  +S   -1)1     n   rij    T   Sij 

=  iJ iJ ( J— )  ( J—) 

r  KB  -1)1  (n.+T/   lVTJ 


Such  an  assumption  is  made  in  [2],  [3],  [7],  and  [9] 
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with  Etr^]  =  (njS^)/^  and  Var(r±j)  =  [  (njS1J)/Tj  ]  •  [  (nj+T  )/T  ]  . 
The  variance  of  r.   is  equal  to  the  mean  multiplied  by  a  constant  greater 
than  one.   In  particular,  when  n  is  large  relative  to  T  ,  as  at  the 
beginning  of  the  season,  this  constant  is  much  greater  than  one,  allowing 
for  a  large  relative  initial  variance. 


Comparison  with  Beta-Binomial  Distribution 

The  negative-binomial  distribution  for  r  .  just  derived  bears  a 
close  relationship  to  the  Beta-binomial  distribution  used  by  Murray  and 
Silver  [9].   They  represent  the  demand  data-generating  process  by  a 
binomial  share-of-market  model  in  which  the  binomial  parameter  p  is 
assigned  a  Beta  prior  distribution.   The  resulting  unconditional  distri- 
bution for  demands  for  a  particular  item  during  a  specified  number  of 
transactions  is  Beta-binomial  with  parameters: 


n.S,.     .     n.S..    n.+T.    T.-S. . 

ij      T    *   "        T.      T.     T.+l 
3  rij       J       J      J 

The  means  of  the  two  distributions  are  the  same  and  the  variances  are 
almost  identical,  differing  only  by  the  last  term  representing  the  correc- 
tion for  a  finite  population;  in  our  data  the  smallest  T.  is  I.  =  approxi- 
mately 2000  transactions,  and  the  median  S   is  13.2  so  that  the  value  of 
that  last  term  would  be  0.993,  a  negligible  correction.   Computationally, 
the  Beta-binomial  model  requires  evaluation  of  the  factorial  (T.+n  -1)!, 
while  the  negative-binomial  requires  (r  +S   -1) ! ,  a  much  easier  task. 
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Figure  6  contains  a  plot  of  a  negative  binomial  distribution  on 
Normal  probability  paper  for  the  case  when  K  is  relatively  small  (39, 
or  1/4  of  median  demands  of  156),  and  for  Period  1.   Its  upward  curve 
is  able  to  capture  the  more-than-Normal  probability  of  unusually  large 
demands  accounting  for  the  skewness  present  in  all  the  previous  figures. 
The  range  of  this  curve  to  be  used  in  decision-making  will  generally  lie 
between  the  50th  and  95th  percentiles;  note  that  for  the  numerical  case 
plotted  (the  worst  for  this  approximation),  the  Normal  approximation  will 
be  too  large  in  most  cases  but  probably  not  by  enough  to  create  serious 
error.   (This  approximation  is  explored  further  in  Section  V.) 

The  use  of  equation  (10)  in  practice  would  involve  calculating  S.. 
and  T.  after  each  period  j,  estimating  (and  perhaps  re-estimating)  n.,  and 
using  (10)  to  represent  the  probability  density  function  of  remaining  de- 
mands for  item  i  during  the  rest  of  the  selling  season. 

V.    A  Decision  Model  Using  the  Negative-Binomial  Distribution 

In  theory  there  may  be  multiple  opportunities  to  place  orders  for 
stock  throughout  the  season,  although  orders  placed  later  in  the  season 
face  increasing  probabilities  of  non-fulfillment  since  manufacturers  are 
generally  working  on  the  next  season's  items  by  that  time.   A  thorough 
treatment  of  the  multiple-opportunity-to-buy  case  would  involve  a  multistage 
decision  framework  with  solution  by  dynamic  programming  as  illustrated  by 
[5]  and  [9]. 

For  the  specific  circumstances  of  the  mail-order  situation  used  to 
obtain  the  data,  it  is  possible  to  avoid  the  multistage  formulation  as 
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Figure  6:   Plot  of  Negative  Binomial  Distribution 
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There  is  a  point  in  the  season  beyond  which  one  cannot  reasonably  expect 
to  obtain  delivery  of  items  for  this  selling  season.   Delaying  the  final  com- 
mitment decreases  the  cost  of  uncertainty;  against  this  improvement  the  buyer 
must  assess  the  qualitative  cost  of  delaying  the  final  procurement,  with  its 
accompanying  risks  of  non-fulfillment,  and  select  (ahead  of  time)  the  approxi- 
mate time  in  the  season  for  the  final  procurement.  Then  that  quantity  should 
be  determined  by  the  standard  newsboy  formula,  with  appropriate  parameters; 
prior  commitments  can  be  made  essentially  according  to  economic  quantity  con- 
siderations as  long  as  they  do  not  represent  an  appreciable  probability  of 
overstocking  for  the  entire  season. 

The  error  involved  in  using  the  Normal  approximation  to  the  negative- 
binomial  distribution  when  the  latter  is  the  underlying  distribution  was 
explored  as  follows.   In  Period  1  of  our  median  numerical  example,  the 
optimal  Q  is  173  total  units  (including  25  used  to  fill  demands  in  period  1). 
The  normal  approximation  produces  an  approximate  order  quantity  of  175 
units;  the  increase  in  cost  of  uncertainty  resulting  from  this  higher-than- 
desired  inventory  level  is  $0,566,  quite  a  small  percentage  of  the  values 
in  Table  4.   Thus,  if  the  numbers  for  this  example  are  at  all  typical,  the 


The  above  methodology  is  implemented  as  follows.  Each  period 
J  =  1.....9  is  considered  in  turn  as  the  last  opportunity  to  order 
replenishment  stock,  and  the  optimal  amount  to  "order  up  to" 
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is  determined  by  the  usual  newsboy  formula: 


(11) 


*.v(Q, 


s.J 


nb  \j'n.+T.  '  ii 
1   J   J 


v"  re  C  is  the  unit  cost  of  overage,  C  the  unit  (opportunity)  cost  of 
underage,  Q.  the  optimal  inventory  level  to  order  up  to  at  the  end  of 
period  j,  and  F  ,(')  the  cumulative  negative  binomial  distribution  function 
of  remaining  demand  over  the  season  for  item  i.   Equation  (11)  was  used  to 
evaluate  optimal  inventory  for  the  numerical  case  in  which  E[K  ]  =  156  (the 
median) ,  actual  demands  D, .  follow  the  pro-rata  percent  breakdown  implied 
by  n. ,C  =  $17,  and  C  =  $7.   The  expected  cost  of  uncertainty  can  be 
estimated  [14,  pp.  131-2]  by  equation  (12): 


(12) 


Expected  Uncertainty  Cost  =  (C  +C  )  O        P(r=Q) 

rij 


Table  4  contains  these  results 


Expected  Cost  of  Uncertainty 


Period 

1 

2 

3 

4 

5 

6 

7 

8 

9 

Q  (whole  season) 

173 

166 

163 

162 

160 

160 

158 

157 

156 

lost  of  Uncertainty 

203.59 

127.16 

109.78 

87.25 

74.24 

57.15 

45.22 

25.39 

0 

larginal  Benefit  of 
)elay  for  One  Period 

76.43 

17.38 

22.53 

13.01 

17.09 

11.93 

19.83 

25.39 

— 
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The  benefit  relationship  would  be  somewhat  smoother  if  the  periods  contained 
identical  numbers  of  transactions;  also,  because  Q  is  integer  the  value  of 
P(r=Q)  is  lumpy. 

Table  4  indicates  the  approximate  magnitude  of  the  reduction  in  the 
cost  of  uncertainty  as  the  final  commitment  is  delayed.   As  stated  above, 
against  this  improvement  the  buyer  must  assesss  the  qualitative  cost  of 
delaying  the  final  procurement,  with  its  accompanying  risks  of  non-fulfill- 
ment, and  select  (ahead  of  time)  the  approximate  time  in  the  season  for  the 
final  procurement.   Then  the  optimal  order-up-to  quantity  may  be  determined 
by  equation  (11) . 

The  error  involved  in  using  the  Normal  approximation  to  the  negative- 
binomial  distribution  when  the  latter  is  the  underlying  distribution  was 
explored  as  follows.   In  Period  1  of  our  median  numerical  example,  the 
optimal  Q  is  173  total  units  (including  25  used  to  fill  demands  in  period  1). 
The  normal  approximation  produces  an  approximate  order  quantity  of  175  units; 
the  increase  in  cost  of  uncertainty  resulting  from  this  higher-than-desired 
inventory  level  is  $0,566,  quite  a  small  percentage  of  the  values  in  Table  4. 
Thus,  if  the  numbers  for  this  example  are  at  all  typical,  the 
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statistical  derivations  and  tests  notwithstanding,  a  simple  normal 
approximation  produces  decision  values  close  enough  to  the  optimal  ones 
to  ignore  the  difference. 

Issues  of  Implementation 

The  data  analyzed  above  represented  demand  by  item;  however,  the 
items  (women's  sportswear)  come  in  sizes  and  colors.   It  is  the  custom  of 
the  trade  to  forecast  by  item  and  then  to  disaggregate  the  order  into 
sizes  and  colors  using  various  ad  hoc  rules.   The  argument  given  for 
choosing  the  item  as  the  unit  for  analysis  is  that  the  distribution  of 
sizes  is  relatively  constant  and  that  sales  of  individual  stock-keeping 
units  (item  by  size  and  color)  are  highly  correlated.   Conversely,  further 
aggregation  of  items  into  classes  of  garment  (for  example)  would  not  con- 
tribute any  useful  information  since  demands  across  items  tend  to  be  quite 
variable.   The  latter  point  was  explored  in  some  detail  by  correlating  the 
values  W.  (the  logarithms  of  ratios  of  successive  forecasts  of  total 
seasonal  demand  by  item)  across  items  rather  than  across  time.   If  substi- 
tutability  or  complementarity  among  items  of  a  given  class  (e.g.,  women's 
slacks)  is  present,  the  correlation  matrix  should  illustrate  it.   There 
were  only  eight  observations.  (W  through  W„)  for  each  item  so  that  large 
correlations  could  be  expected  by  chance  alone.   Table  5  contains  the 
intercorrelation  matrix  for  12  slacks;  there  does  not  seem  to  be  any 
discernible  pattern  among  the  large  correlations  or  their  signs.   Depen- 
dencies cannot  be  exploited  if  their  occurrence  and  sign  cannot  be 
anticipated. 
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Table  5 
Intercorrelation  Matrix  for  12  Slacks  (n=8) 


1.0000 
0.22VU 
-O.I»07G 
0.5717 

-0.1o7i) 
-O.kOltii 
0.799b 
-0.82u8 

0.3HJ5 
0.325S 
-0.7225 


1.0000 
0.03U6 
0.  Ill  l»l 

-O.ltVJU 
-U.1J87 
0.1576 
■0.UJ12 
-U.Gui.7 
U.U022 
0.2Ui!ii 
-0.7003 


-95      -0.U12I) 


0.5553 
-0.7909 
0.32111 
0.3952 


0.1.393 
-0.2293 
-0.50711 

0.5069 


0. 1.331 
■O.6I1SO 
0.5U89 
0.2751 


-0.S032 
-U.56G6 
0.5116 
0.3137 
-0.7355 


•0.8919 
■0.0125 
0.81.52 


1.0000 
■0.3183 

0.6222 


Exploring  the  disaggregation  issue,  Table  6  presents  the  size  and 
color  demand  breakdown  for  six  sweaters  after  normalizing  the  figures  to 
percentages  of  each  item/ color 's  total  demand.   Item  is  denoted  by  letter 
and  color  by  number  in  the  first  column. 


Sweater  Demands  by  Color  and  Size 


Item/ Color 

Sizes 

Total  across  Size 

1 
.136 

2 

.271 

3 
.271 

4 
.322 

Al 

118 

A2 

.120 

.227 

.240 

.413 

75 

B3 

.179 

.316 

.253 

.253 

95 

B4 

.167 

.317 

.250 

.267 

60 

C5 

.146 

.214 

.301 

.340 

103 

C6 

.140 

.211 

.316 

.333 

57 

D 

.125 

.299 

.304 

.272 

184 

E 

.085 

.245 

.340 

.330 

200 

F7 

.213 

.337 

.247 

.202 

89 

F8 

.067 

.500 

.333 

.100 

30 

Mean 

.1378 

.2937 

.2855 

.2832 

Std.  Dev. 

.0406 

.0812 

.0358 

.0816 
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The  mean  size  breakdown  and  its  standard  deviation  are  also  contained 
in  the  table;  the  size  distribution  does  tend  to  remain  relatively  stable 
among  items,  with  item  F,  color  8,  being  the  most  extreme  outlier.   As 
between  colors,  indicated  by  numbers  in  the  first  column,  there  is  no_  ten- 
dency for  demands  to  be  evenly  divided  between  colors.   Thus  it  seems 
doubtful  that  much  is  gained  from  aggregating  item-color  demands  into 
item  demands;  rather,  it  is  likely  that  a  need  for  action  is  obscured 
since  low  demands  for  one  color  may  mask  high  demands  for  another.   If 
inventory  and  demand  by  item  are  assessed  they  may  appear  to  be  in 
balance  when  the  inventory  by  color  is  seriously  out  of  balance.   It  has 
frequently  been  observed  in  practice  that  there  are  oversales  of  one  stock- 
keeping  unit  and  markdowns  of  another  within  one  item,  thereby  reinforcing 
our  suspicion  that  aggregating  across  colors  can  hide  inventory  imbalance 
(relative  to  demand)  across  colors. 

Items  returned  by  the  customer  have  so  far  been  ignored.   In  the  mail- 
order situation  returns  constitute  a  significant  factor  as  they  both  indicate 
that  recent  sales  overstated  true  demand  and  they  increase  planned  inventory. 
Although  average  returns  for  the  situation  studied  were  10%,  individual 
items  had  return  rates  as  high  as  38%,  and  the  bad  cases  cause  the  worst 
losses.   We  suggest  a  simple  monitoring  scheme  be  implemented  which  considers 
the  number  of  pieces  of  an  item  "exposed  to  risk  of  return,"  recognizing 
the  delays  in  the  return  process,  and  which  creates  an  "expected  returns' 
forecast  or  allowance  which  is  updated  as  experience  is  gained  through  the 
season.   The  details  of  this  suggested  scheme  are  in  [14];  since  no  detailed 
data  were  available,  no  results  can  be  presented. 
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APPENDIX  1 

CALCULATION  TO  FIT  STUDENT'S  T  DISTRIBUTION  TO  SAKPLE  DATA 

The  procedure  uses  the  ratio  of  two  estimates  of  the  standard 
deviation  to  find  an  approximate  fit  and  then  exploits  two  values 
usually  found  in  the  tabled  t-distrihution  to  get  an  exact  fit  to 
each  period.   Define  the  terms  as  follows: 

V  .  =  the  order  statistic  closest  to  the  25th  percentile. 

Wj75=  «   ."      »        "    »  "  75th 

s  -  the  standard  deviation  of  all  the  sample  W.  .  for  period  j, 

j  being  the  period  being  fitted. 
n  =  degrees  of  freedom,  the  parameter  we  are  seeking  to  estima-te. 

If  the  sample  is  small  all  the  values  between  the  25th  percentile 
and  the  75th  should  be  plotted  on  normal  paper,  a  line  fitted,  and 

the  intersections  of  this  line  with  the  .25  and  .75  ordinates  used 

2x2 

for  ¥   oc  and  V   „.      Then    if  x  =  - }   n  =  _££ . 

*25  '75  .7413  (|vr<75  -¥>25|)  *    ~i 

n   -   co    if  x   =   1    (implying  a   Normal    curve),    and   is  undefined    if  x  <■  1 

(a  Normal    curve    is   better   than   any   t -curve).      This    formula    gives   a 

value   for   n   that    is   too   large  because   even   between   the   quartiJes   the 

slope   of  the   t-curve    is    steeper   than   the  Normal    curve.      The  value   of 

x   is  properly     ■   / . pr     where   a   is   dependent   on   n   and   is 

M'    .75  ~W. 25  l; 

2l where  b      =    the  value   found   in   the   P  =    .  50   column  of   the 


^b 
n 

t-table   for  n   degrees   of   freedom. 
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x(using  .741^  x(  exact) 

1-779 


2 

3  1.402 

4  1.264 

5  1.193 

6  1.149 

7  1.122 

8  1.102 

9  I.O85 


1.743 

1-414 

1.290 

1.225 

1.183 

1.145 

1.133 

The  value  cf  s  needed  to  construct  this  table  is  found  by 
dividing  by  two  the  value  found  in  the  P  =  .05  colun.n  of  a  two-tailed 
table  of  Student's  t-distributior.;  this  slightly  underestimates  the 
vaJue  of  s.   In  the  exact  column  is  given  the  value  of  s  found 
analytically,  which  is  /ZEZ"  %    vMch  ig  .^.^    ^   ^    ( ^ 
cannot  be  infinite  for  a  sample). 
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APPENDIX  2 
Poisson  Test 
The  table  shows  in  Col  (1)  each  potential  cell  value  from  1  to  125, 
in  Col  (2)  the  frequency  with  which  D..  actually  took  this  value,  and  in 
Col  (3)  the  theoretical  expected  frequency  calculated  on  the  Poisson 
assumption  using  parameters  calculated  from  the  sample  data. 


(1) 

(2) 

(3) 

"(1) 

(?) 

(3) 

(1) 

(?) 

(3) 

1 

13 

13.398G17 

1*1* 

7 

6.908802 

3/ 

1 

0.1*93715 

2 

21 

23.512201 

1*5 

9 

6.3871*05 

38 

0 

0.'i'(69  95 

3 

35 

32.278381 

1*6 

3 

5.888939 

89 

1 

0.1*03591* 

k 

37 

38.61*9101 

1*7 

3 

5.1*18077 

90 

0 

0.363506 

5 

l»8 

I*2.5075'»7 

1*8 

2 

l* .  9  7  8  6  7  3 

91 

0 

0.326678 

6 

l*t* 

U.  2551*  l*'i 

1*9 

2 

l*. 573301* 

92 

0 

0.2  93  02  6 

7 

l*b 

1*1*.  507031* 

50 

3 

l*. 205319 

93 

0 

0.2021*37 

8 

37 

1*3.81*2050 

51 

5 

3.871*035 

91* 

0 

0.231*771* 

9 

1*5 

1*2.681*1*60 

52 

3 

3.579723 

95 

0 

0.209865 

10 

1*1 

M.2SC583 

53 

3 

3.321196 

90 

0 

0.187536 

11 

33 

39.779001 

5i* 

l* 

3.096387 

97 

0 

0.  107581 

12 

59 

33.217130 

55 

2 

2. 902 05  8 

98 

0 

0.  11*98  11 

13 

1*0 

3G.  018088 

50 

3 

2.736818 

99 

0 

0.  131*019 

Ik 

31 

31*. 930505 

57 

3 

2.595399 

100 

2 

0.119999 

15 

35 

33.327311 

58 

l* 

2  .  1*  7 1*  8  0  3 

101 

0 

0.  10755/ 

1G 

33 

31.052  193 

59 

3 

2.371373 

102 

0 

0.090512 

17 

32 

29.979331 

00 

1 

2.281/67 

103 

0 

0. 0ob093 

18 

3'i 

23.330751* 

01 

2 

2.2  02  75  2 

101* 

0 

0. 0  7  7963 

19 

28 

20.728581 

b2 

I* 

2.  131't53 

105 

0 

0.  0/01/1* 

20 

28 

25.192585 

03 

5 

2.005371 

100 

0 

0.003221 

21 

lb 

23.73821(0 

01* 

2 

2.  0  02 'i  1*5 

107 

0 

0.  050998 

22 

19 

22.37051*1 

05 

2 

1.91(0930 

108 

0 

0.  0  5  11)1*/ 

23 

lb 

21.113023 

bb 

3 

1.8795/2 

109 

0 

0.01)01*89 

21* 

22 

19.95173  7 

07 

0 

1.8171*21 

110 

1 

0.  01)20/8 

25 

17 

18.8391*33 

08 

2 

1. 75  3  868 

111 

0 

0. 03819  3 

2b 

11 

17.921878 

69 

1 

1.68851*6 

112 

1 

0.031*791 

27 

17 

17.01*1782 

70 

0 

1.621375 

113 

0 

0. 031363 

28 

13 

10.2  39  710 

71 

1 

1.552391* 

111* 

0 

0.029396 

29 

22 

15.501*91*9 

72 

2 

1.1*8181*7 

115 

0 

0. 027365 

30 

20 

1'*.  8  2  01 3  b 

75 

1 

1.1*10025 

116 

0 

0.025773 

31 

lb 

11*. 191793 

71* 

1 

1.337292 

117 

0 

0.021*602 

32 

12 

13.591230 

75 

2 

1.261*135 

118 

0 

0.023835 

33 

10 

15.011*011* 

76 

2 

1.19091*7 

119 

0 

0.0231*59 

31* 

17 

12.1*53138 

77 

0 

1.118195 

120 

0 

0.  02  31)1*7 

35 

10 

11.8  99  801* 

78 

0 

1.01*02  3/ 

121 

0 

0.  023771* 

36 

10 

11.31*9292 

7  9 

0 

0. 975037 

122 

0 

0.  021*1*21 

3  7 

11 

10.797573 

80 

0 

0. 900032 

123 

0 

0.  02  5  31(9 

38 

12 

10.21*21*00 

31 

1 

0.839012 

121* 

0 

0. 026522 

39 

15 

9.083'iS'i 

32 

2 

0.  7  7'io90 

125 

0 

0.027905 

40 

11 

9.121028 

83 

0 

0. 71273'* 

12b 

2 

1.32329'* 

1*1 

0 

8.55921*2    . 

8U 

1 

0.053371 

12  7 

0 

1*.  '(65119 

1(2 

7 

7.999889 

85 

1 

0. 59099't 

128 

0 

O.OlitOO'i 

1*3 

7 

7.1*1*8092 

80 

0 

0.51*371*6 

129    113U 

112  7.690933 

The  cells  numbered  above  125  have  special  meaning  as  follows: 


126:   all  possible  cell  values  of  126  or  more:   It  may  be 
seen  that  D   twice  exceeded  125  and  was  expected  to 
do  so  1.82  x. 
127:   the  result  for  cell  value  zero  is  recorded  here. 
128,  129:   these  serve  for  bookkeeping  and  should  be  ignored. 
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